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Abstract—In [doi; 10.1109/ICMEW.2014.68907111, a graph- 
based denoising is performed by projecting the noisy image to 
a lower dimensional Krylov snbspace of the graph Laplacian, 
constrncted nsing nonnegative weights determined by distances 
between image data corresponding to image pixels. We extend 
the constrnctlon of the graph Laplacian to the case, where 
some graph weights can be negative. Removing the positivity 
constraint provides a more accurate Inference of a graph model 
behind the data, and thus can improve quality of filters for 
graph-based signal processing, e.g., denoising, compared to the 
standard construction, without affecting the costs. 

I. Introduction 

Constructing efficient signal hlters is a fundamental prob¬ 
lem in signal processing with a vast literature; see, e.g., 
recent papers m, 0,0,11,0,0 and references there. 
A hlter can be described by a transformation F, often non¬ 
linear, of an input signal, represented by a vector x, into 
a hltered signal, represented by a vector F{x). We revisit 
some classical constructions of hlters aimed at signal noise 
reduction, with the emphasis on bilateral hlter, popular in 
image denoising 0, 0, 0, ifTOl . The goal of the hlter is 
signal smoothing, reducing a high oscillatory additive noise. 
The smoothing can be achieved by averaging, which can 
be typically interpreted as a low-pass hlter, minimizing the 
contribution in the hltered signal of highly oscillatory modes, 
treated as eigevectors of a graph Laplacian; see, e.g., im. 

It is desirable to preserve edges in the ideal noise-free 
signal, even at the costs of an increased PSNR. Edge¬ 
conscious hlters detect, often implicitly, the locations of 
the edges and attempt using less aggressive or anisotropic 
averaging at these locations. Eully automatic edge detection 
in a noisy signal is difficult, typically resulting in non¬ 
linear hlters, i.e. where the hltered vector F{x) depends non- 
linearly on the input vector x. However, it can be assisted 
by a guiding signal, having the edges in the same locations 
as in the ideal signal; see, e.g., 0, IfT^ . |[T3l . 

Graph signal processing, introducing eigenvectors of the 
graph Laplacian as natural extensions of the Eourier bases, 
sheds new light at image processing; see, e.g., m, M, 
Mi, ini- In ESI, graph-based hltering of noisy images is 
performed by directly computing a projection of the image to 
be hltered onto a lower dimensional Krylov subspace of the 
normalized graph Laplacian, constructed using nonnegative 
graph weights determined by distances between image data 
corresponding to image pixels. We extend the construction of 
the graph Laplacian to the case, where some weights can be 
negative, radically departing from the traditional assumption. 


II. Preliminaries 

Let us for simplicity hrst assume that the guiding signal, 
denoted by y, is available and can be used to reliably detect 
the locations of the edges and, most importantly, to determine 
the edge-conscious linear transformation (matrix) Fy such 
that the action of the hlter F{x) is given by the following 
matrix-vector product FyX = F{x). Having a specihc con¬ 
struction of the guided hlter matrix Fy as a function of y, one 
can dehne a self-guided non-linear hlter, e.g., as F^x, which 
can be applied iteratively, starting with the input signal vector 
xo as follows, Xi+i = F{xi), i = 0,1,... ,m; cf., e.g., E9l . 

Similarly, an iterative application of the linear guided 
hlter can be used, mathematically equivalent to applying the 
powers of the square matrix Fy, i.e. Xm = {FyY^ xq, thus 
naturally called the power method, which is an iterative form 
of PCA; see, e.g., Eo), E]. To avoid a re-normalization of 
the hltered signal, it is convenient to construct the matrix 
Fy in the form Fy = Dy^Wy, where entries of the square 
matrix Wy are called weighs. The matrix Dy is diagonal, 
made of row-sums of the matrix Wy, which are assumed to 
be non-zero. Thus, Dy^Wy multiplied by a column-vector 
of ones, gives again the column-vector of ones. 

Let us further assume that the matrix Wy is symmetric 
and that all the entries (weighs) in Wy are nonnegative. Lor 
signal denoising, the following observations are the most 
important. The right eigenvector vi of the matrix D~^Wy 
with the eigenvalue /ii = 1 is trivial, just made of ones, 
only affecting the signal offset. Since the iterative matrix 
Fy = Dy^Wy is diagonalizable, the power method gives 

Xra = {Fy)^ Xo = Ej /r™ (uj Dxo) Vj, (1) 

where 1 = |/ri| > \p. 2 \ > are the eigenvalues of the 
matrix D~^Wy corresponding to the eigenvectors vj scaled 
such that vf Dvj = Sij. The power method, according to E), 
suppresses contributions of the eigenvectors corresponding to 
the smallest eigenvalues. Thus, the matrix Wy needs to be 
constructed in such a way that these eigenvectors represent 
the noisy part of the input signal, while the other eigenvectors 
are edge-conscious; cf. anisotropic diffusion ll22l . ||2T1 . Il24l . 

Let us introduce the guiding Laplacian Ly = Dy — Wy 
and normalized Laplacian Dy^Ly = I — Dy^Wy matrices. 
In ESl . the power method E) is replaced with a projection of 
the image vector x to be denoised onto a lower dimensional 
Krylov subspace of the guiding normalized graph Laplacian 
Dy^Ly and implemented, e.g., using the Conjugate Gradient 
(CG) method; see, e.g., lEa, |l 26 l, IETI. 


Fig. 1. Discrete cosine transform low frequency modes. 


Fig. 2. Edge-preserving low frequency eigenmodes. 


III. Motivation 


One of the most popular edge-preserving denoising filters 
is the bilateral filter (BF), see, e.g., ll28l . ll2^ and references 
there, which takes the weighted average of the nearby pixels. 
The weights Wij may depend on spatial distances and signal 
data similarity, e.g.. 


Wij = exp 


2^^ ) 


exp 


2a2 ) 


( 2 ) 


where pi denotes the position of the pixel i, the value y\i] 
is the signal intensity, and and cr^. are filter parameters. 
To simplify the presentation and our arguments, we further 
assume that the signal is scalar on a one-dimensional uniform 
grid, setting without loss of generality the first multiplier in 
dU to be 1, and that the weights Wij are computed only for 
the nearest neighbors and set to zero otherwise. 

Let us start with a constant signal, where y\i] — y[j] = 0. 
Then, Wi-n = wn = wa+i = 1 and the graph Laplacian 
Ly = Dy — Wy Is tridlagonal matrix that has nonzero 
entries 1 and —1 in the first row, —1 and 1 in the last row, 
and [—1 2 — 1] in every other row. This is a standard three- 
point-stencil finite-difference approximation of the negative 
second derivative of functions with homogeneous Neumann 
boundary conditions, i.e., vanishing first derivatives at the end 
points of the interval. Its eigenvectors are the basis vectors of 
the discrete cosine transform; see the first five low frequency 
eigenmodes (the eigenvectors corresponding to the smallest 
eigenvalues) of Ly in Figure [T] As can be seen in Figure [T] 
all smooth low frequency eigenmodes turn flat at the end 
points of the interval, due to the Neumann conditions. 

The key observation is that the Laplacian row sums in 
the first and last rows vanish for any signal, according to 
the standard construction of the graph Laplacian, no matter 
what formulas for the weights are being used! Thus, any low 
pass filter based on low frequency eigenmodes of the graph 
Laplacian flattens the signal at the end points. 


Let us now use formula 0 for a piece-wise constant 
guiding signal y with the jump large enough to result in 
a small value Wii+i = Wi+n for some index i. The first 
five vectors of the corresponding Laplacian are shown in 
Figure |2] All the plotted in Figure |2] vectors are aware of the 
jump, representing an edge in our one-dimensional signal y, 
but they are also all flat on both sides of the edge! Such a 
flatness is expected to appear for any guiding signal y giving 
a small value wnj^i = Wi+ii. 

The presence of the flatness in the low frequency modes 
of the graph Laplacian Ly on both sides of the edge in 
the guiding signal y is easy to explain. When the value 
Wii+i = Wi+ii is small relative to other entries, the 
matrix Ly becomes nearly block diagonal, with two blocks, 
which approximate graph Laplacian matrices of the signal y 
restricted to sub-intervals of the signal domain to the left and 
to the right of the edge. 

The low frequency eigenmodes of the graph Laplacian Ly 
approximate combinations of the low frequency eigenmodes 
of the graph Laplacians on the sub-intervals. But each of the 
low frequency eigenmodes of the graph Laplacian on the sub¬ 
interval suffers from the flattening effect on both ends of the 
sub-interval, as explained above. Combined, it results in the 
flatness in the low frequency modes of the graph Laplacian 
Ly on both sides of the edge. For denoising, the flatness 
of the vectors determining the low-pass Alter may have a 
negative effect for self-guided denoising even of piece-wise 
constant signals, if the noise is large enough relative to the 
jump in the signal, as shown in Section |Vl 

The attentive reader notices that method O is based on 
Dy^Wy, related to the normalized graph Laplacian Dy^Ly, 
not the Laplacian Ly used in our arguments above. Although 
the diagonal matrix Dy is not a scalar identity, and so 
the eigenvectors of D~^Ly, not plotted here, and of Ly 
are different, the difference is not qualitative enough to 
noticeably change the figures and invalidate our explanation. 











Fig. 3. Edge-enhancing low frequency eigenmodes, small negative. 


IV. Negative weights in spectral graph 

PARTITIONING AND FOR SIGNAL EDGE ENHANCING 

The low frequency eigenmodes of the graph Laplacian 
play a fundamental role in spectral graph partitioning, which 
is one of the most popular tools for data clustering; see, e.g., 
Eol, El, El- A limitation of the conventional spectral 
clustering approach is embedded in its definition based on 
the weights of graph, which must be nonnegative, e.g., based 
on a distance measuring relative similarities of each pair of 
points in the dataset. For the dataset representing values of a 
signal, e.g., pixel values of an image, formula (|2]l is a typical 
example of determining the nonnegative weights, leading to 
the graph adjacency matrix Wy with nonnegative entries, as 
assumed in Section |II] and in all existing literature. 

In applications, data points may represent feature vectors 
or functions, allowing the use of correlation for their pairwise 
comparison. The correlation can be negative, or, more gener¬ 
ally, points in the dataset can be dissimilar, contrasting each 
other. In conventional spectral clustering, the only available 
possibility to handle such a case is to replace the anticorre¬ 
lation, i.e. negative correlation, of the data points with the 
uncorrelation, i.e. zero correlation. The replacement changes 
the corresponding negative entry in the graph adjacency 
matrix to zero, to enable the conventional spectral clustering 
to proceed, but nullifies a valid comparison. 

A common motivation of spectral clustering comes from 
analyzing a mechanical vibration model in a spring-mass 
system, where the masses that are tightly connected have 
a tendency to move synchronically in low-frequency free 
vibrations; e.g., El- Analyzing the signs of the compo¬ 
nents corresponding to different masses of the low-frequency 
vibration modes of the system allows one to determine 
the clusters. The mechanical vibration model may describe 
conventional clustering when all the springs are pre-tensed to 
create an attracting force between the masses. However, one 
can pre-tense some of the springs to create repulsive forces! 


Fig. 4. Edge-enhancing low frequency eigenmodes, more negative. 


In the context of data clustering formulated as graph parti¬ 
tioning, that corresponds to negative entries in the adjacency 
matrix. The negative entries in the adjacency matrix are not 
allowed in conventional graph spectral clustering. Neverthe¬ 
less, the model of mechanical vibrations of the spring-mass 
system with repulsive springs remains valid, motivating us 
to consider the effects of negative graph weights. 

In the spring-mass system, the masses, which are attracted, 
would move together synchronically in the same direction in 
low-frequency free vibrations, while the masses, which are 
repulsed, have the tendency to move synchronically in the 
opposite direction. Using negative, rather than zero, weights 
at the edge of the guiding signal y for the purposes of the 
low-pass filters thus is expected to repulse the flatness of 
low frequency eigenmodes of the graph Laplacian Ly on the 
opposite sides of the edge of the signal y, making the low 
frequency eigenmodes to be edge-enhancing, rather than just 
edge-preserving; cf. If34l on sharpening. 

Figures [3 and |4] demonstrate the effect of edge-enhancing, 
as a proof of concept. Both Figures [3] and |4] display the five 
eigenvectors for the five smallest eigenvalues of the same 
tridiagonal graph Laplacian as that corresponding to Figure 
|2]except that the small positive entry of the weights Wi i+i = 
Wi+ii for the same i is substituted by —0.05 in Figure[3and 
by —0.2 in Figure |4] The previously flat around the edge 
eigenmodes in Figure |2] are repelled in opposite directions 
on the opposite sides of the edge in Figures |3] and |4] 

Negative weights require caution, since even small changes 
dramatically alter the behaviors of the low frequency eigen¬ 
modes around the edge, as seen in Figures [3 and |4] Making 
the negative value more negative, we observe by comparing 
Figure [3 to Figure |4] that the leading eigenmode, displayed 
using the blue color in both figures, corresponding to the 
smallest nonzero eigenvalue forms a narrowing layer around 
the signal edge, while other eigenmodes become less affected 
by the change in the negative value. 











Fig. 5. Edge-presei'ving filtering. 


Fig. 6. Edge-enhancing filtering. 


V. Edge-enhancing filters 

In this section, as a proof of concept, we numerically test 
the proposed edge-enhancing filters on a toy one-dimensional 
example using the classical nonlinear self-guiding BE and 
a guided (by a noiseless signal) BE accelerated with a 
conjugate gradient (CG-BE) method, as suggested in M- 
The specihc CG algorithm used in our tests is as described 
in Algorithm [U 


Algorithm 1: Conjugate Gradient Guided Eilter 
Input: signal vector to be filtered xq, matrices Dy and 

Ly 

TQ - LyXQ 

for fc = 0,1,..., m — 1 do 
Sk = Dy^rk 

if fc = 0 then 

I Po = So 

else 

Pk = Sk + PkPk-i, where 

g ^ {sk,rk) 

end 


Qk - LyPk 

_ {sk,rk) 

{pk,qk) 


T/c-l-l — Xk -f CtfcPfc 
Tfc-f-l — Tk O^kQk 


end 

Output: hltered vector Xm 


The noise is additive Gaussian, and the noisy signal is 
displayed using grey dots. The nonzero weights are computed 
by (|2]) with aj, = 0.5 and cri = 0.1 only for j = i — 1, i, i + 
1, resulting in tridiagonal matrices W and L. BE is self- 
guided, with W and L recomputed on every iteration using 
the current approximation Xk to the hnal hltered signal Xm- 
CG-BE uses the hxed nonzero weights computed also by (|2]), 
but for the noiseless signal y resulting in the hxed tridiagonal 
matrices Wy and Ly. The number of iterations in BE, 100, 
and CG-BE, 15, is tuned to match the errors. Eormula dU puts 
ones on the main diagonal of W, thus for small positive or 
negative wa+i = Wi+n the matrix D is well-conditioned. 

Eigure |5] demonstrates the traditional approach with non¬ 
negative weighs. We observe, as discussed in Section |III] 
battening at the end points. Moreover, there is noticeable 
edge smoothing in all corners, due to a large noise and 
relatively small number of signal samples, despite of the use 
of the edge-preserving formula (|2]l. We set tuned negative 
graph weights —2 x 10“^, —10“^, —10“® for i = 100, 250, 
and 350 correspondingly, to obtain Eigure |6] which shows 
dramatic improvements both in terms of PSNR and edge 
matching, compared to Eigure |5] 

VI. Conclusion 

The proposed novel technology of negative graph weights 
allows designing edge enhancing hlters, as explained the¬ 
oretically and shown numerically for a synthetic example. 
Our future work concerns determining the optimal negative 
weights, testing the concept for image hltering, and exploring 
its advantages in spectral data clustering using correlations. 
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